Loading...

Data Migration Specialist – SQL DWH to Cloudera CDP

Location: Delhi NCR, India

Experience: 5 - 8 yrs

Job Type: Full-Time / Contract

Education:

  • UG: B.Tech/B.E. in Computer Science, Data Engineering, Information Systems, or a related field
  • PG: Any Postgraduate (Preferred)

Job Description

Project Role Description: We are seeking an experienced Data Migration Specialist to support migration of Data Warehouse workloads from SQL-based platforms to Cloudera Data Platform (CDP). The candidate will be responsible for migrating database tables, schemas, SQL queries, and ETL workloads to the Cloudera ecosystem (Hive, Impala, Spark) while ensuring performance, accuracy, and scalability. The role requires strong expertise in SQL-based data warehouses, big data technologies, and data migration strategies, along with hands-on experience in query translation, workload migration, and performance tuning.

Key Responsibilities:

Data Warehouse Migration

  • Plan and execute migration of data warehouse tables, schemas, and datasets from SQL-based platforms to Cloudera CDP.
  • Analyze existing DWH structures, dependencies, and workloads before migration.
  • Design strategies for data migration, schema conversion, and workload optimization.

SQL & Query Migration

  • Convert and optimize SQL queries from traditional databases to Hive or Impala compatible queries.
  • Ensure functional equivalence and performance optimization of migrated queries.
  • Validate query outputs to ensure data accuracy and consistency.

Workload Migration

  • Migrate ETL workflows and batch workloads to Spark or Hive-based pipelines.
  • Implement best practices for distributed processing and query performance.
  • Optimize workloads to run efficiently on Cloudera clusters.

Data Validation & Testing

  • Perform data reconciliation and validation between source and target systems.
  • Conduct performance testing and tuning for migrated workloads.
  • Ensure data quality and integrity post migration.

Collaboration & Support

  • Work closely with data engineers, architects, and Cloudera administrators.
  • Support teams during migration planning, testing, and go-live phases.
  • Provide troubleshooting support for migration-related issues.

Documentation

  • Document migration strategies, transformation logic, and query conversion rules.
  • Maintain documentation for data lineage, mapping, and validation processes.
Qualifications:
  • Bachelor's degree in Computer Science, Data Engineering, Information Systems, or related field.
  • 5–8 years of experience in Data Warehousing, SQL development, and migration projects.
Required Skills:
  • Strong expertise in SQL and relational data warehouse platforms.
  • Hands-on experience with Cloudera Data Platform (CDP), Hive, and Impala.
  • Experience migrating tables, schemas, queries, and workloads to big data environments.
  • Knowledge of Spark for data processing and transformation.
  • Experience with data validation, reconciliation, and performance tuning.
  • Understanding of distributed data processing architectures.
Preferred Skills:
  • Experience migrating from platforms such as Teradata, Oracle, SQL Server, or other enterprise DWH systems to Hadoop/Cloudera.
  • Familiarity with data pipeline orchestration tools such as Airflow.
  • Knowledge of data warehouse design and lakehouse architectures.
  • Experience with automation or scripting (Python, Shell) for migration activities.

Why Choose Us

We're Best in Data Industry with 10 Years of Experience

We’re leaders in the data industry with over 10 years of experience, delivering innovative data solutions that drive business transformation. Our expertise in data pipeline creation has empowered various clients across industries to harness the full potential of their data. For a global fintech firm, we built real-time data pipelines enabling instant fraud detection and risk monitoring. For a leading retail company, we developed scalable pipelines for real-time sales and inventory tracking. Additionally, for a healthcare provider, we created pipelines for secure, real-time patient data processing, improving care and compliance.

Real time Data Ingestion
Batch Data Ingestion
Event Handling on Moving data

21

Happy Clients

84

Project Complete

Data Migration Specialist Job